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S eco nd-generati on image cod in g: an overview 
M. M. Reid, R. J. Millar, N. D. Black 

March 1997 ACM Computing Surveys (CSUR), volume 29 issue l 

Additional Information: full c i t a t ion , ab s tra ct, references, i nd ex term s , 
review 



Full text available: ^ pdf(1 2,23 MB) 



This article gives an overview of a diverse selection of currently used second-generation 
image coding techniques. These techniques have been grouped into similar categories in 
order to allow a direct comparison among the varying methods. An attempt has been made, 
where possible, to expand upon and clarify the details given by the original authors. The 
relative merits ans shortcomings of each of the techniques are compared and contrasted. 
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D ata c lusterin g : a r evi ew 

A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSUR), Volume 31 issue 3 

r- .. A x -, u. 0i -coc nA i^ D \ Additional Information: full citation, abstract, references, citings, index 
Full text available: w\ pdf(636.2 4 KB) - — — — 

^ " ter ms , r ev iew 

Clustering is the unsupervised classification of patterns (observations, data items, or 
feature vectors) into groups (clusters). The clustering problem has been addressed in many 
contexts and by researchers in many disciplines; this reflects its broad appeal and 
usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult 
problem combinatorially, and differences in assumptions and contexts in different 
communities has made the transfer of useful generic co ... 

Keywords: cluster analysis, clustering applications, exploratory data analysis, incremental 
clustering, similarity indices, unsupervised learning 
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February 1995 Proceedings of the 1995 ACM 23rd annual conference on Computer 
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A survey of imag e re gist ration techniques 
Lisa Gottesfeld Brown 

December 1992 ACM Computing Surveys (CSUR), Volume 24 issue 4 

_ hi — Mtr , Mnx Additional Information: full citation, abstract, references, citings, index 

Full text available:^fi5J0 MB) temTsT^ew 

Registration is a fundamental task in image processing used to match two or more pictures 
taken, for example, at different times, from different sensors, or from different viewpoints. 
Virtually all large systems which evaluate images require the registration of images, or a 
closely related operation, as an intermediate step. Specific examples of systems where 
image registration is a significant component include matching a target with a real-time 
image of a scene for target recognition, mon ... 

Keywords: image registration, image warping, rectification, template matching 



CgmjDuMioM^ 

Paul Suetens, Pascal Fua, Andrew J. Hanson 

March 1992 ACM Computing Surveys (CSUR), volume 24 issue l 

r- .. i. . 0 Jf/D07ym Additional Information: full citation, abstract, refers 

Full text available: l@pdf(6,37.MB} ; . 

^ terms , review 

This article reviews the available methods for automated identification of objects in digital 
images. The techniques are classified into groups according to the nature of the 
computational strategy used. Four classes are proposed: (1) the simplest strategies, which 
work on data appropriate for feature vector classification, (2) methods that match models 
to symbolic data structures for situations involving reliable data and complex models, (3) 
approaches that fit models to the photometry and ... 

Keywords: image understanding, model-based vision, object recognition 



On f ace detection in the compressed domain 
Huitao Luo, Alexandras Eleftheriadis 

October 2000 Proceedings of the eighth ACM international conference on Multimedia 

Full text available: ^| pdf(948.31 KB) Additional Information: full citation , abstract , references , index terms 

We propose a fast face detection algorithm that works directly on the compressed DCT 
domain. Unlike the previous DCT domain processing designs that are mainly based on skin- 
color detection, our algorithm analyzes both color and texture information contained in the 
DCT parameters, therefore could generate more reliable detection results. Our texture 
analysis is mainly based on statistical model training and detection. A number of 
fundamental problems, e.g., block quantization, prepro ... 

Keywords: DCT, JPEG, MPEG, face detection 



7 Omni- fa c e de te ct i on fo r v i deo/ima ge c onten t descri p t i o n 
Gang Wei, Ishwar K. Sethi 

November 2000 Proceedings of the 2000 ACM workshops on Multimedia 

tr ... * . IM a , H OOMm Additional Information: full citation, ab^ract^ 

Full text available: 113 pdf( 1.33 M B) - — — 
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An omni-face detection scheme for image/video content description is proposed in this 
paper. It provides the ability to extract high-level features in terms of human activities 
rather than low-level features like color, texture and shape. The system relies on an omni- 
face detection system capable of locating human faces over a broad range of views in color 
images or videos with complex scenes. It uses the presence of skin-tone pixels coupled with 
shape, edge pattern and face-specific features ... 

Keywords: content-based retrieval, face detection, image annotation, side-view faces, 
skin-tone filtering 



Efficient algorithms for g eome tric optimizati o n 
Pankaj K. Agarwal, Micha Sharir 

December 1998 ACM Computing Surveys (CSUR), volume 30 issue 4 

r- „ . ^ -. u. 0 -,A ixox Additional Information: full citation, abstract, references, citings, index 

Full text available: pdf(57 7. 7 4 KB) terms ~ 

We review the recent progress in the design of efficient algorithms for various problems in 
geometric optimization. We present several techniques used to attack these problems, such 
as parametric searching, geometric alternatives to parametric searching, prune-and-search 
techniques for linear programming and related problems, and LP-type problems and their 
efficient solution. We then describe a wide range of applications of these and other 
techniques to numerous problems in geometric optim ... 

Keywords: clustering, collision detection, linear programming, matrix searching, 
parametric searching, proximity problems, prune-and-search, randomized algorithms 



9 Model-based recognition in robot vision 
Roland T. Chin, Charles R. Dyer 

March 1986 ACM Computing Surveys (CSUR), volume 18 issue l 

,- „ . . u, 0i MA *A Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(4.9 4 MB) . : ~ 

]£ -^ t erms , rev iew 

This paper presents a comparative study and survey of model-based object-recognition 
algorithms for robot vision. The goal of these algorithms is to recognize the identity, 
position, and orientation of randomly oriented industrial parts. In one form this is 
commonly referred to as the "bin-picking" problem, in which the parts to be recognized are 
presented in a jumbled bin. The paper is organized according to 2-D, 2V2-D, and 3-D object 
representations, which are used as the basis for ... 

10 Content analy sis : A mid-level representation f ra m ewo rk for seman tic sports v i d eo 
analysis 

Ling-Yu Duan, Min Xu, Tat-Seng Chua, Qi Tian, Chang-Sheng Xu 
November 2003 Proceedings of the eleventh ACM international conference on 
Multimedia 

Full text available: || pdf( 1.42 MB) Additional Information: full citation , abst ra ct, references, index Jejnms 

Sports video has been widely studied due to its tremendous commercial potentials. Despite 
encouraging results from various specific sports games, it is almost impossible to extend a 
system for a new sports game because they usually employ different sets of low-level 
features appropriate for the specific games and closely coupled with the use of game 
specific rules to detect events or highlights. There is a lack of internal representation and 
structure to be generic and applicable for many differ ... 

Keywords: events, mid-level representation, semantics, sports video 
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11 A survey on wavelet applications in data mining ' 
Tao Li, Qi Li, Shenghuo Zhu, Mitsunori Ogihara 

December 2002 ACM SIGKDD Explorations Newsletter, volume 4 issue 2 

Full text available: || pdf(330.06 KB) Additional Information: fu ll citation , abstract, references, citings 

Recently there has been significant development in the use of wavelet methods in various 
data mining processes. However, there has been written no comprehensive survey available 
on the topic. The goal of this is paper to fill the void. First, the paper presents a high-level 
data-mining framework that reduces the overall process into smaller components. Then 
applications of wavelets for each component are reviewd. The paper concludes by 
discussing the impact of wavelets on data mining research an ... 

12 Radiance interpolants for accelerated bounded-error ray tracing 
Kavita Bala, Julie Dorsey, Seth Teller 

July 1999 ACM Transactions on Graphics (TOG), volume 18 issue 3 

i— ii i . u, « -ooo czo i^m Additional Information: f ull citation , abstra ct, references, citings, index 
Fu text avai ab e: to pdf(888.58 KB) 

v t erm s, review 

Ray tracers, which sample radiance, are usually regarded as offline rendering algorithms 
that are too slow for interactive use. In this article we present a system that exploits 
object-space, ray-space, image-space, and temporal coherence to accelerate ray tracing. 
Our system uses per-surface interpolants to approximate radiance both interactive and 
batch ray tracers. Our approach explicity decouples the two primary operations of a ray 
tracer— shading and visibility de ... 

Keywords: 4D interpolation, approximation, data structures, error bounds, interactive, 
interval arithmetic, radiance, rendering, rendering systems, visibility 



1 3 Three-dim ens io nal o b ject reco g nit ion I 
Paul J. Besl, Ramesh C. Jain 

March 1985 ACM Computing Surveys (CSUR), volume 17 issue l 

r- .. A . L1 .t.-,-,***™ Additional Information: full citation, abstract, referent 
Full text available: 113 pdf(7.76.M.B) : " : 

terms , review 

A general-purpose computer vision system must be capable of recognizing three- 
dimensional (3-D) objects. This paper proposes a precise definition of the 3-D object 
recognition problem, discusses basic concepts associated with this problem, and reviews the 
relevant literature. Because range images (or depth maps) are often used as sensor input 
instead of intensity images, techniques for obtaining, processing, and characterizing range 
data are also surveyed. 

14 Watermarking: The e f fec t s of i nv isi ble w ater m arking on satellite i m age c^ssiffcato | 
Gregory L. Heileman, Yunlong Yang 

October 2003 Proceedings of the 2003 ACM workshop on Digital rights management 

r— ii • ■ L1 « Jf/ , , c . mv Additional Information: full citation, abstract, references, citings, index 

Full text available: tju pdf (1 .46 MB) ; - 

terms 

Remotely sensed satellite images are an important source of geographical data commonly 
used as input for various types of classification algorithms. For example, these algorithms 
are commonly used to classify earth land cover, analyze crop conditions, assess mineral 
and petroleum deposits, and quantify urban growth. Many vendors of digital images are 
using or are considering the use of invisible watermarking as a means of protecting their 
images from theft or unauthorized usage. Indeed, the use ... 
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Keywords: classification, content protection, information hiding, satellite imagery, 
watermarking 



1 5 S u rveillance: Vid eo retrieval using spatio-te mporal des criptors | 
Daniel DeMenthon, David Doermann 

November 2003 Proceedings of the eleventh ACM international conference on 
Multimedia 

Full text available: pelf (994. 11 KB) Additional Information: full cita tion, abstract, references, jndexterms 

This paper describes a novel methodology for implementing video search functions such as 
retrieval of near-duplicate videos and recognition of actions in surveillance video. Videos are 
divided into half-second clips whose stacked frames produce 3D space-time volumes of 
pixels. Pixel regions with consistent color and motion properties are extracted from these 
3D volumes by a threshold-free hierarchical space-time segmentation technique. Each 
region is then described by a high-dimensional point wh ... 

Keywords: action recognition, content-based indexing and retrieval, object motion, space- 
time segmentation, spatio-temporal descriptors, video retrieval of near-duplicates 



16 From coar s e t o f i n e sk in an d face detection 
Hichem Sahbi, Nozha Boujemaa 

October 2000 Proceedings of the eighth ACM international conference on Multimedia 

Full text available: ^ pdf( 3 48 .Q3 K B) Additional Information: fu l l c it a t io n, ab stract , ref erences , i nd ex terms 

A method for fine skin and face detection is described that starts from a coarse color 
segmentation. Some regions represents parts of human skin and are selection by 
minimizing an error between the color distribution of each region and the output of a 
compression decompression neural network, which learns skin color distribution for several 
populations of different ethnicity. This ANN is used to find a collection of skin regions, which 
is used in a second learning step to provide parameters ... 

Keywords: Bayesian classifier, color segmentation, face detection, image indexing, neural 
networks 



17 Adaptive Hindi OCR using generalized Hausdorff image comparison 
Huanfeng Ma, David Doermann 

September 2003 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 2 Issue 3 

Full text available: p d f (280.45 KB) Additional Information: full citation, abstract, rMerences, index terms 

We present an adaptive Hindi OCR implemented as part of a rapidly retargetable language 
tool effort. The system includes: script identification, character segmentation, training 
sample creation, and character recognition. In script identification, Hindi words are 
identified from bilingual or multilingual documents based on features of the Devanagari 
script or using Support Vector Machines. Identified words are then segmented into 
individual characters in the next step, where the composite charac ... 

Keywords: Optical character recognition (OCR), document processing, generalized 
Hausdorff image comparison, script identification 

18 

Document and images analysis: INFTY: an integrated OCR syMem fo r mathematica l 
documents 



http://portal.acm.or^ 



9/24/04 



Remits (page 1): ("image segmentation") and cluster and detect and preprocess 



Page 6 of 6 



Masakazu Suzuki, Fumikazu Tamari, Ryoji Fukuda, Seiichi Uchida, Toshihiro Kanahori 
November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Full text available: pdf( 32 2 .41 KB) Additional Information: f ull c itation, abstract, references, index terms 

An integrated OCR system for mathematical documents, called INFTY, is presented. INFTY 
consists of four procedures, i.e., layout analysis, character recognition, structure analysis of 
mathematical expressions, and manual error correction. In those procedures, several novel 
techniques are utilized for better recognition performance. Experimental results on about 
500 pages of mathematical documents showed high character recognition rates on both 
mathematical expressions and ordinary texts, and suf ... 

Keywords: character and symbol recognition, mathematical OCR, structure analysis of 
mathematical expressions 



1 9 Query processin g in a multime dia docu ment system I 
Elisa Bertino, Fausto Rabbiti, Simon Gibbs 

January 1988 ACM Transactions on Information Systems (TOIS), volume 6 issue l 

^ -. u. 0 nullum Additional Information: full citation, abstract, references, citings, index 
Full text available: 113 pdf(2.94 MB) - " . ~ - ~ ^ 

" terms, reyjew 

Query processing in a multimedia document system is described. Multimedia documents are 
information objects containing formatted data, text, image, graphics, and voice. The query 
language is based on a conceptual document model that allows the users to formulate 
queries on both document content and structure. The architecture of the system is outlined, 
with focus on the storage organization in which both optical and magnetic devices can 
coexist. Query processing and the different strategies ... 

20 Do cumen t image und er sta ndin g 
Sargur N. Srihari 

November 1999 Proceedings of 1986 ACM Fall joint computer conference 

Full text available: Qpdf(1.38 M B ) Additional Information: full cita tion , referenc es, ci t i n gs, i n d ex t e rms 
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21 Progre s s in Pict u r e Process i n g: 19 6 9-71 
Azriel Rosenfeld 

June 1973 ACM Computing Surveys (CSUR), Volume 5 issue 2 

Full text available: 1f| pdf(2.34 M B) Additional Information: f u l l cit ation, refer en c es , c i ti n gs, index terms 



22 Novelt y and to pic ch an ge: Do mai n-i ndependent text se gmenta tion us i n g aniso t ro pic 
diffusi on a nd d ynamic p ro gra mmin g 
Xiang Ji, Hongyuan Zha 

July 2003 Proceedings of the 26th annual international ACM SIGIR conference on 
Research and development in informaion retrieval 

Full text available: ^ pdf(171.61 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a novel domain-independent text segmentation method, which 
identifies the boundaries of topic changes in long text documents and/or text streams. The 
method consists of three components: As a preprocessing step, we eliminate the document- 
dependent stop words as well as the generic stop words before the sentence similarity is 
computed. This step assists in the discrimination of the sentence semantic information. 
Then the cohesion information of sentences in a document o ... 



Keywords: anisotropic diffusion, document-dependent stop words, dynamic programming, 
text segmentation 



23 Algori thm i c t ra nsfo rm a tions i n th e i m ple m entation of K- mea ns c luste r i n g on 
reconfigurable hardware 

Mike Estlick, Miriam Leeser, James Theiler, John J. Szymanski 

February 2001 Proceedings of the 2001 ACM/SIGDA ninth international symposium on 

Field programmable gate arrays 

c..n*~* -i «m , m r C n^m Additional Information: full citation, abstract, references, citings, index 

Full text available: TO pdf(255^95 KB) ~ 

terms 

In mapping the k-means algorithm to FPGA hardware, we examined algorithm level 
transforms that dramatically increased the achievable parallelism. We apply the k-means 
algorithm to multi-spectral and hyper-spectral images, which have tens to hundreds of 
channels per pixel of data. K-means is an iterative algorithm that assigns assigns to each 
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pixel a label indicating which of K clusters the pixel belongs to.K-means is a common 
solution to the segmentation of multi ... 

24 An integrated color-spatial approach to content- basedjmag e retrieval 
Wynne Hsu, S. T. Chua, H. H. Pung 

January 1995 Proceedings of the third ACM international conference on Multimedia 

Full text available: [1| htm( 38.02 KB) Additional Information: full citation , references , citings, index terms 



Keywords: color retrieval, content-based retrieval, image segmentation, spatial retrieval 



25 Picture Processing by Computer 
Azriel Rosenfeld 

September 1969 ACM Computing Surveys (CSUR), volume l issue 3 

Full text available: ^ pdf(2.69 MB) Additional Information: full citation , references , citings, index terms 



26 Fast image retrieval using color-spatial information 
Beng Chin Ooi, Kian-Lee Tan, Tat Seng Chua, Wynne Hsu 

May 1998 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 7 Issue 2 

Full text available: ^ pdf(496. 55 K B) Additional Information: ful l c i ta tion, abstract, in d ex te r m s 

In this paper, we present an image retrieval system that employs both the color and spatial 
information of images to facilitate the retrieval process. The basic unit used in our 
technique is a single-colored cluster, which bounds a homogeneous region of that color in 
an image. Two clusters from two images are similar if they are of the same color and 
overlap in the image space. The number of clusters that can be extracted from an image 
can be very large, and it affects the accuracy of ret ... 

Keywords: Color-spatial information, Content-based retrieval, Sequenced multi-attribute 
tree, Single-colored cluster 



27 CVEPS - a compressed video editing and parsing system 
Jianhao Meng, Shih-Fu Chang 

February 1997 Proceedings of the fourth ACM international conference on Multimedia 

Full text available; fg) pdf(1.38 MB) Additional Information: full citation , references , citings , index terms 



28 Face recognition: A literature survey 

W. Zhao, R. Chellappa, P. J. Phillips, A. Rosenfeld 

December 2003 ACM Computing Surveys (CSUR), volume 35 issue 4 

Full text available: pdf(4.28 MB) Additional Information: f ull citation , abstract , references , index terms 

As one of the most successful applications of image analysis and understanding, face 
recognition has recently received significant attention, especially during the past several 
years. At least two reasons account for this trend: the first is the wide range of commercial 
and law enforcement applications, and the second is the availability of feasible technologies 
after 30 years of research. Even though current machine recognition systems have reached 
a certain level of maturity, their success is ... 
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29 Voronoi diagrams — a survey of a fundamental geometric data structure 
Franz Aurenhammer 

September 1991 ACM Computing Surveys (CSUR), volume 23 issue 3 

Full text available: |j| pdf(5.18 MB) Additional Information: full citation , referen ces, citings, index terms 



Keywords: cell complex, clustering, combinatorial complexity, convex hull, crystal 
structure, divide-and-conquer, geometric data structure, growth model, higher dimensional 
embedding, hyperplane arrangement, k-set, motion planning, neighbor searching, object 
modeling, plane-sweep, proximity, randomized insertion, spanning tree, triangulation 



30 Special issue on ICML: Coupled clustering: a method for detecting structural 
c o rresp ondenc e 

Zvika Marx, Ido Dagan, Joachim M. Buhmann, Eli Shamir 

March 2003 The Journal of Machine Learning Research, volume 3 

Full text available: l f|| pdf(96 7.1 5 KB ) Additional Information: full ci tat i o n, abs tract, index terms 

This paper proposes a new paradigm and a computational framework for revealing 
equivalencies (analogies) between sub-structures of distinct composite systems that are 
initially represented by unstructured data sets. For this purpose, we introduce and 
investigate a variant of traditional data clustering, termed coupled clustering, which outputs 
a configuration of corresponding subsets of two such representative sets. We apply our 
method to synthetic as well as textual data. Its achievement ... 

31 An efficient core-area detec ti o n algorithm for fast nois e-fre e imag e que ry p rocess i ng 
Khanh Vu, Kien A. Hua, Due A. Tran 

March 2001 Proceedings of the 2001 ACM symposium on Applied computing 

Full text available: fifl pdf(461.28 KB ) Additional Information: LilLcitatLgn, refer enc es, citing s , i ndex t erms 



Keywords: content-based image retrieval, core-area detection algorithm, 
indexing/retrieval, noise-free queries 



32 Image-based objects 

Manuel M. Oliveira, Gary Bishop 

April 1999 Proceedings of the 1999 symposium on Interactive 3D graphics 

Full text available: || pdf(1.Q4 MB) Additional Information: full citation , re ferences , citings , index terms 



Keywords: 3-D warp, image-based objects, image-based rendering 



33 O PT ICS: order ing poin t s t o ide n tif y th e c l u s terin g str u ct ur e 
Mihael Ankerst, Markus M. Breunig, Hans-Peter Kriegel, Jorg Sander 

June 1999 ACM SIGMOD Record , Proceedings of the 1999 ACM SIGMOD international 

conference on Management of data, volume 28 issue 2 
Full text available: *g]pdf(1.77 MB ) Additional Information: ful l c i tation , abstract, rcjtaences, citings, index 
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terms 

Cluster analysis is a primary method for database mining. It is either used as a stand-alone 
tool to get insight into the distribution of a data set, e.g. to focus further analysis and data 
processing, or as a preprocessing step for other algorithms operating on the detected 
clusters. Almost all of the well-known clustering algorithms require input parameters which 
are hard to determine. but have a significant influence on the clustering result. Furthermore, 
for many real-data sets there doe ... 

Keywords: cluster analysis, database mining, visualization 



34 A JP EG co de c a d a ptive to re gio n importance 

Jiying Zhao, Yoshihisa Shimazu, Koji Ohta, Rina Hayasaka, Yutaka Matsushita 

February 1997 Proceedings of the fourth ACM international conference on Multimedia 

Full text available: Qpdf(1.47 MB) Additional Information: full citation, references, indextexms 



Keywords: JPEG, adaptive codec, fuzzy reasoning, human visual system, region 
importance 



35 Spe ci al issu e on Mac h in e lear n i n g met ho ds for text and imag e s : Mat ching words and 
pictures 

Kobus Barnard, Pinar Duygulu, David Forsyth, Nando de Freitas, David M. Blei, Michael I. 
Jordan 

March 2003 The Journal of Machine Learning Research, volume 3 

Full text available: ^| p d f(789.Q 4 KB ) Additional Information: full citation, abstrac t, citings , ind ex terms 

We present a new approach for modeling multi-modal data sets, focusing on the specific 
case of segmented images with associated text. Learning the joint distribution of image 
regions and words has many applications. We consider in detail predicting words associated 
with whole images (auto-annotation) and corresponding to particular image regions (region 
naming). Auto-annotation might help organize and access large collections of images. 
Region naming is a model of object recognition as a process ... 

36 Hierarchical mesh d ec omposition u s in g fuzzy clu st e ri n g and cuts 
Sagi Katz, Ayellet Tal 

July 2003 ACM Transactions on Graphics (TOG), volume 22 issue 3 

Full text available: ^ pdf(3.89 MB) Additional Information: full cit ation, abst r a ct, re fe r enc es 

Cutting up a complex object into simpler sub-objects is a fundamental problem in various 
disciplines. In image processing, images are segmented while in computational geometry, 
solid polyhedra are decomposed. In recent years, in computer graphics, polygonal meshes 
are decomposed into sub-meshes. In this paper we propose a novel hierarchical mesh 
decomposition algorithm. Our algorithm computes a decomposition into the meaningful 
components of a given mesh, which generally refers to segment ... 

Keywords: control-skeleton extraction, mesh decomposition, mesh segmentation 



37 Learnable visual keywords for image classification 
Joo-Hwee Lim 

August 1999 Proceedings of the fourth ACM conference on Digital libraries 

Full text available: *g] pdf(380.75 KB) Additional Information: full citation , references , citings , index terms 
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38 Image Ca te g or i zation by Le arning a nd Reaso ning with Regions B 
Yixin Chen, James Z. Wang 

August 2004 The Journal of Machine Learning Research, volume 5 
Full text available: ^ pdf( 1.31 MB) Additional Information: f ull cita ti o n, abstrac t 

Designing computer programs to automatically categorize images using low-level features is 
a challenging research topic in computer vision. In this paper, we present a new learning 
technique, which extends Multiple-Instance Learning (MIL), and its application to the 
problem of region-based image categorization. Images are viewed as bags, each of which 
contains a number of instances corresponding to regions obtained from image 
segmentation. The standard MIL problem assumes that a bag is labeled p ... 

39 A multimodal learning i nte rf ace fo r grounding spok en la n gua g e in sensory perceptions jjj 
Chen Yu, Dana H. Ballard 

July 2004 ACM Transactions on Applied Perception (TAP), volume l issue l 

Full text available: ^pdfd.73 M B) Additional Information: full citati o n , ab s t ract , ref e rences, index terms 

We present a multimodal interface that learns words from natural interactions with users. 
In light of studies of human language development, the learning system is trained in an 
unsupervised mode in which users perform everyday tasks while providing natural language 
descriptions of their behaviors. The system collects acoustic signals in concert with user- 
centric multisensory information from nonspeech modalities, such as user's perspective 
video, gaze positions, head directions, and hand moveme ... 

Keywords: Multimodal learning, cognitive modeling, multimodal interaction 



40 Special issue on spatial da t abase systems: A sem antic mo d e ling a pproach for image Q 
retri eval by co ntent 

Wesley W. Chu, Ion T. Ieong, Ricky K. Taira 

October 1994 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 3 Issue 4 

Full text available: ^ pdf(1 .68 MB) Additional Information: ful l cita t i on , abstract, references, citings 

We introduce a semantic data model to capture the hierarchical, spatial, temporal, and 
evolutionary semantics of images in pictorial databases. This model mimics the user's 
conceptual view of the image content, providing the framework and guidelines for 
preprocessing to extract image features. Based on the model constructs, a spatial 
evolutionary query language (SEQL), which provides direct image object manipulation 
capabilities, is presented. With semantic information captured in the model, spa ... 

Keywords: image, medical, multimedia databases, spatial query processing, temporal 
evolutionary query processing 
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41 I n terac tive contr ol of av a tars animated w it h h uma n motion data 

Jehee Lee, Jinxiang Chai, Paul S. A. Reitsma, Jessica K. Hodgins, Nancy S. Pollard 

July 2002 ACM Transactions on Graphics (TOG) , Proceedings of the 29th annual 

conference on Computer graphics and interactive techniques, volume 21 issue 3 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: fB pdf(8.00 MB) 



Real-time control of three-dimensional avatars is an important problem in the context of 
computer games and virtual environments. Avatar animation and control is difficult, 
however, because a large repertoire of avatar behaviors must be made available, and the 
user must be able to select from this set of behaviors, possibly with a low-dimensional input 
device. One appealing approach to obtaining a rich set of avatar behaviors is to collect an 
extended, unlabeled sequence of motion data appropria ... 

Keywords: avatars, human motion, interactive control, motion capture, virtual 
environments 



42 Alg or ith ms: S ignif ican t fac e t retrieval f o r re a l - time 3D so und rende ring in c o mp lex 
virtual environments 
Chris Joslin, IMadia Magnenat-Thalmann 

October 2003 Proceedings of the ACM symposium on Virtual reality software and 
technology 

Full text available: ^ pdf( 367.6 3 KB) Additional Information: full citation, abst rac t, refer ences, index terms 

Sound rendering requires that many different aspects are considered simultaneously, 
especially when rendering a real-time virtual environment. In 3D sound rendering, much 
the same as for graphics, one of the major influencing factors is the number of reflective 
polygons in a scene and due to the increase in the ability of most common graphics cards 
this number can now be very high, especially when scene designers produce an optimum 
scene using other optimizing tools such as Polygon Cruncher or R ... 

Keywords: bounding-box, scene segmentation, sound rendering, virtual environments 



43 Extending c ase-based r ea s o n i ng b y discover ing an d us i n g imag e fe aturegjpJVF 
Igor Jurisica, Janice Glasgow 



http://portal.acm.org/results.cfm?query=%20%28%22image%20segme 9/24/04 



Respite, (p^ge 3): ("image segmentation") and cluster and detect and preprocess 



Page 2 of 6 



March 2000 Proceedings of the 2000 ACM symposium on Applied computing 

Full text available: ||| pdf(8 6 2 .40 KB) Additional Information: McitetfeQ. r efe r ences , ind ex t erms 



Keywords: case-based and image-based reasoning, image-feature extraction, in-vitro 
fertilization (IVF), knowledge discovery 



44 Automatic text recognition for video indexing 
Rainer Lienhart 

February 1997 Proceedings of the fourth ACM international conference on Multimedia 

Full text available: ^g pdf(1.15 MB) Additional Information: full citation , references , citings , index terms 

Keywords: OCR, character recognition, character segmentation, video content analysis, 
video indexing, video processing 



45 Modeling focus of a tte ntio n for m eetin g indexing 
Rainer Stiefelhagen, Jie Yang, Alex Waibel 

October 1999 Proceedings of the seventh ACM international conference on Multimedia 
(Part 1) 

Full text available: ^ pdf(L94 MB) Additional Information: full citatio n , references, c it ing s , index terms 



46 Visibility s orting and com pos iting without s p litting fo r image l ayer decomposition s 
John Snyder, Jed Lengyel 

July 1998 Proceedings of the 25th annual conference on Computer graphics and 
interactive techniques 

Full text available: ^ pdf(591.53 KB) Additional Information: full citation , references , citings , index terms 



Keywords: compositing, kd-tree, nonsplitting layered decomposition, occlusion cycle, 
occlusion graph, sprite, visibility sorting 

47 Projecti ve and v iew-dependent text ures: Gig a Wa l k : i nter active wa lk t h rough of complex Q 
en vironments 

William V. Baxter, Avneesh Sud, Naga K. Govindaraju, Dinesh Manocha 

July 2002 Proceedings of the 13th Eurographics workshop on Rendering 

Full text available: |||pdf(4. 81 MB) Additional Information: fu l l c ita tion, abs tr a ct, references, citings 

We present a new parallel algorithm and a system, GigaWalk, for interactive walkthrough of 
complex, gigabyte-sized environments. Our approach combines occlusion culling and levels- 
of-detail and uses two graphics pipelines with one or more processors. GigaWalk uses a 
unified scene graph representation for multiple acceleration techniques, and performs 
spatial clustering of geometry, conservative occlusion culling, and load-balancing between 
graphics pipelines and processors. GigaWalk has been use ... 

Keywords: engineering visualization, interactive display systems, levels-of-detail, 
occlusion culling, parallel rendering 
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48 The Quadtree and Related Hierarchical Data Structures 
Hanan Samet 

June 1984 ACM Computing Surveys (CSUR), volume 16 issue 2 

Full text available: ^ pdf(4.87 MB) Additional Information: full citation , references , citings , index terms 



49 Meetin g experience: Experiential me eting system 
Ramesh Jain, Pilho Kim, Zhao Li 

November 2003 Proceedings of the 2003 ACM SIGMM workshop on Experiential 
telepresence 

Full text available: ^ pdf(388.84 KB ) Additional Information: full citat io n , abstract, r eference s, ind ex term s 

We are developing experiential meeting systems to allow people to be tele-present in a 
remote meeting and to be able to review proceedings of a meeting or of several meetings 
using all the data recorded in a meeting. We consider this as a problem in management and 
experiential access to all multimedia data acquired in a meeting. The data includes video, 
audio, presentations, text material, databases and websites related to people and the 
discussions in the meeting, and any other data or informat ... 

Keywords: data event, elemental event and domain event, event, event based data 
processing, experiential systems, meeting 



50 Research track: Eliminating noisy information in Web pages for data mining | 
Lan Yi, Bing Liu, Xiaoli Li 

August 2003 Proceedings of the ninth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^ pdf(468.87 KB) Additional Information: fu ll c i tation , ab stract, references, index terms 

A commercial Web page typically contains many information blocks. Apart from the main 
content blocks, it usually has such blocks as navigation panels, copyright and privacy 
notices, and advertisements (for business purposes and for easy user access). We call these 
blocks that are not the main content blocks of the page the noisy blocks. We show that the 
information contained in these noisy blocks can seriously harm Web data mining. 
Eliminating these noises is thus of great importance. In this pa ... 

Keywords: Web mining, noise detection, noise elimination 



51 Think globally, fit locally: u nsupervised learn ing of low dime nsional m a nifo lds 
Lawrence K. Saul, Sam T. Roweis 

December 2003 The Journal of Machine Learning Research, volume 4 

Full text available: ^ pd f( 2 . 91 MB) Additional Information: full citation, ab s t r a c t, index terms 

The problem of dimensionality reduction arises in many fields of information processing, 
including machine learning, data compression, scientific visualization, pattern recognition, 
and neural computation. Here we describe locally linear embedding (LLE), an unsupervised 
learning algorithm that computes low dimensional, neighborhood preserving embeddings of 
high dimensional data. The data, assumed to be sampled from an underlying manifold, are 
mapped into a single global coordinate system of lowe ... 

52 ai a nd com putational log ic and i m a ge analysi s ( A l): Conc aten ate feature ex tr act i on for 
robu s t 3D ellipti c o bje ct l o c a li za tion 

Yuichi Motai, Akio Kosaka 
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March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: *g| pdf(371.28 K B) Additional Information: full c itation , abstrac t, ref erences , index terms 

Developing an efficient object localization system for complicated industrial objects is an 
important, yet difficult robotic task. To tackle this problem, we have developed a system 
consisting first of a vision model acquisition editor, where the object salient features are 
acquired through a human-in-the-loop approach. Subsequently, two feature extraction 
algorithms, region-growing and edge-grouping, are applied to the object scene. Finally, by 
Kalman filter estimation of a proper ellipse repre ... 

Keywords: 3D robot vision system, Kalman filter estimation, elliptic edge grouping, 
human-in-the-loop segmentation, salient feature extraction 



53 Comparing in situ mRNA expression patterns of drosophila embryos 
Hanchuan Peng, Eugene W. Myers 

March 2004 Proceedings of the eighth annual international conference on 
Computational molecular biology 

Full text available:*^ pdf(377. 74 KB) Additional Information: ful l ci ta tion, abstract, ref er ences, in dex te r ms 

In situ staining of a target mRNA at several time points during the development of a D. 
melanogaster embryo gives one a detailed spatio-temporal view of the expression pattern 
of a given gene. We have developed algorithms and software for analyzing a database of 
such images with the goal of being able to identify coordinately expressed genes and 
further our understanding of c/s-regulatory control during embryogenesis. Our approach 
combines measures of similarity at bo ... 

Keywords: drosophila, embryogenesis, gaussian mixture model, gene expression, image 
matching, in situ hybridization 



54 High performance clustering based on the similarity join 

Christian Bohm, Bernhard Braunmuller, Markus Breunig, Hans-Peter Kriegel 
November 2000 Proceedings of the ninth international conference on Information and 
knowledge management 

Full text available: pdf(1 34 . 5 7 KB ) Additional Information: full ci ta tion, refe rence s, ci tin gs, index te rms 



Keywords: clustering, data mining, database primitives, multidimensional index structure, 
similarity join 



55 MDM/KDD2002: multimedia data mining between promises and problems 
Simeon J. Simoff, Chabane Djeraba, Osmar R. ZaTane 
December 2002 ACM SIGKDD Explorations Newsletter, volume 4 issue 2 

Full text available: Qpdfd 05.37 KB) Additional Information: full citation , abstract , references 

This report presents a brief overview of multimedia data mining and the corresponding 
workshop series at ACM SIGKDD conference series on data mining and knowledge 
discovery. It summarizes the presentations, conclusions and directions for future work that 
were discussed during the 3 rd edition of the International Workshop on Multimedia Data 
Mining, conducted in conjunction with KDD-2002 in Edmonton, Alberta, Canada. 

Keywords: digital media, image content mining, knowledge discovery, multimedia data 
mining, multimedia databases, sound analysis, video analysis 
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Frank Hoffmann, Klaus Kriegel, Carola Wenk 

June 1998 Proceedings of the fourteenth annual symposium on Computational 
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Full text available: ^p d f( 1.Q7 MB ) Additional Information: full cit ation, references, citings, indexjejims 



57 Adaptive estimation of op tic al flow from general ob ject m o tion 
Zhaoxin Pan, Joseph J. Pfeiffer 

April 1992 Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing: 
technological challenges of the 1990's 

Full text available: ^ pdf ( 679.63 KB ) Additional Information: ful l cita tio n , reference s, ind ex terms 



Keywords: artificial intelligence, computer vision, image processing, motion detection, 
optical flow 



58 Image retrieval, users and usability: A search engine for historical manuscript images 9 
Toni M. Rath, R. Manmatha, Victor Lavrenko 

July 2004 Proceedings of the 27th annual international conference on Research and 
development in information retrieval 

Full text available: ^ pd f(346.95 KB ) Additional Information: full citation , a bstr a c t, references, inii^tj^ms 

Many museum and library archives are digitizing their large collections of handwritten 
historical manuscripts to enable public access to them. These collections are only available 
in image formats and require expensive manual annotation work for access to them. 
Current handwriting recognizers have word error rates in excess of 50% and therefore 
cannot be used for such material. We describe two statistical models for retrieval in large 
collections of handwritten manuscripts given a text query. Bo ... 

Keywords: handwriting retrieval, historical manuscripts, relevance models 



59 S earc hing in me tric s paces 

Edgar Chavez, Gonzalo Navarro, Ricardo Baeza-Yates, Jose Luis Marroquin 
September 2001 ACM Computing Surveys (CSUR), volume 33 issue 3 

,- „* . , Ll ^ ,jr/r^/> avi Additional Information: full citation, abstract, references, citings, index 

Full text available: 113 pdf( 91 6.04 KB) — - - - — ~ 

^ terms 

The problem of searching the elements of a set that are close to a given query element 
under some similarity criterion has a vast number of applications in many branches of 
computer science, from pattern recognition to textual and multimedia information retrieval. 
We are interested in the rather general case where the similarity criterion defines a metric 
space, instead of the more restricted case of a vector space. Many solutions have been 
proposed in different areas, in many cases without cros ... 

Keywords: Curse of dimensionality, nearest neighbors, similarity searching, vector spaces 
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Full text available: ■g| p df( 61 3.63 KB ) 
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